Integration of Semistructured Data with Partial and Inconsistent Information
نویسندگان
چکیده
Data integration of several sources has gained considerable attentions with the recent popularity of the Web. In the real world, some information may be missing (i.e., partial) and some may be inconsistent from several sources. How to obtain information as complete as possible and detect inconsistency from these sources is thus an interesting question. Most existing work uses a simple graph-based or tree-based semistruc-tured data model to represent heterogeneous data coming from various sites, which fail to account for the existence of partial and inconsistent information. In this paper, we redeene the notion of semistructured objects to reeect the existence of partial and inconsistent information and study how to integrate such objects spread in various sources and check consistency in the meantime. We propose a new operator integration for this purpose and discuss its semantic properties.
منابع مشابه
Adaptive Information Analysis in Higher Education Institutes
Information integration plays an important role in academic environments since it provides a comprehensive view of education data and enables mangers to analyze and evaluate the effectiveness of education processes. However, the problem in the traditional information integration is the lack of personalization due to weak information resource or unavailability of analysis functionality. In this ...
متن کاملQUEST: QUery-driven Exploration of Semistructured Data with ConflicTs and Partial Knowledge
An important reality when integrating scientific data is the fact that data may often be “missing”, partially specified, or conflicting. Therefore, in this paper, we present an assertion-based data model that captures both value-based and structure-based “nulls” in data. We also introduce the QUEST system, which leverages the proposed model for Query-driven Exploration of Semistructured data wi...
متن کاملAdaptive Information Analysis in Higher Education Institutes
Information integration plays an important role in academic environments since it provides a comprehensive view of education data and enables mangers to analyze and evaluate the effectiveness of education processes. However, the problem in the traditional information integration is the lack of personalization due to weak information resource or unavailability of analysis functionality. In this ...
متن کاملIntegrating Diverse Information Management Systems: A Brief Survey
Most current information management systems can be classified into text retrieval systems, relational/object database systems, or semistructured/XML database systems. However, in practice, many applications data sets involve a combination of free text, structured data, and semistructured data. Hence, integration of different types of information management systems has been, and continues to be,...
متن کاملCritical Success Factors for Data Virtualization: A Literature Review
Data Virtualization (DV) has become an important method to store and handle data cost-efficiently. However, it is unclear what kind of data and when data should be virtualized or not. We applied a design science approach in the first stage to get a state of the art of DV regarding data integration and to present a concept matrix. We extend the knowledge base with a systematic literature review ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999